Exploiting Domain Knowledge via Grouped Weight Sharing with Application to Text Categorization
نویسندگان
چکیده
A fundamental advantage of neural models for NLP is their ability to learn representations from scratch. However, in practice this often means ignoring existing external linguistic resources, e.g., WordNet or domain specific ontologies such as the Unified Medical Language System (UMLS). We propose a general, novel method for exploiting such resources via weight sharing. Prior work on weight sharing in neural networks has considered it largely as a means of model compression. In contrast, we treat weight sharing as a flexible mechanism for incorporating prior knowledge into neural models. We show that this approach consistently yields improved performance on classification tasks compared to baseline strategies that do not exploit weight sharing.
منابع مشابه
IDENTIFYING AND RANKING FACTORS AFFECTING SUCCESSFUL IMPLEMENTATION OF KNOWLEDGE MANAGEMENT
In the developed countries, many organizations are regarded a...
متن کاملIDENTIFYING AND RANKING FACTORS AFFECTING SUCCESSFUL IMPLEMENTATION OF KNOWLEDGE MANAGEMENT
In the developed countries, many organizations are regarded a...
متن کاملExploiting Structure and Semantics for Expressive Text Kernels
Several problems in text categorization are too hard to be solved by standard bag-of-words representations. Work in kernel-based learning has approached this problem by (i) considering information about the syntactic structure of the input or by (ii) incorporating knowledge about the semantic similarity of term features. In this paper, we propose a generalized framework consisting of a family o...
متن کاملخوشهبندی اسناد مبتنی بر آنتولوژی و رویکرد فازی
Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...
متن کاملSome Studies on Chinese Domain Knowledge Dictionary and Its Application to Text Classification
In this paper, we study some issues on Chinese domain knowledge dictionary and its application to text classification task. First a domain knowledge hierarchy description framework and our Chinese domain knowledge dictionary named NEUKD are introduced. Second, to alleviate the cost of construction of domain knowledge dictionary by hand, we use a boostrapping-based algorithm to learn new domain ...
متن کامل